課程大綱

課程資訊

課程名稱	社群媒介輿論分析 Social Media and Public Opinion Analysis
開課學期	107-2
授課對象	社會科學院新聞研究所
授課教師	謝吉隆
課號	JOUR7088
課程識別碼	342 M3040
班次
學分	3.0
全/半年	半年
必/選修	選修
上課時間	星期四2,3,4(9:10~12:10)
上課地點	新聞315
備註	社群媒介輿論分析之學生應有修過程式設計課程，本課程使用R。限本系所學生(含輔系、雙修生) 總人數上限：15人
Ceiba 課程網頁	http://ceiba.ntu.edu.tw/1072R
課程簡介影片
核心能力關聯	核心能力與課程規劃關聯圖
課程大綱
為確保您我的權利,請尊重智慧財產權及不得非法影印
課程概述	本課程將帶領學生利用資料科學方法，截取社群資料、分析、或者探索社群輿論背後的立場、階級、情緒、傳播策略、性別等議題。 1. 本學期課程目標是「以個人為單位」發表以文字探勘方法於傳播議題上之研究論文。 2. 本課程並非程式語言教學課程，學生將被要求依照所介紹的分析方法，就自己的研究議題自行上網找方法實作。 3. 如欲修課的同學，請於開學第一週前將三頁初步計畫書以E-mail寄送給開課教師（MUST）。初步計畫書內容須包含，研究主題、研究前沿（網路上別人做到哪裡，請找英文）、所欲處理的資料、個人的程式學習狀況概述、個人修課的需求、本學期其他修課概況。 4. 由於以專題式方法進行，故上課時數可能會延長一至二小時，如有下午課程者請勿選修。 5. 由於論文撰寫費時，預計將延後繳交成績一個月，故大四或研究所欲畢業的學生請勿選修。
課程目標	1. 學生具有應用文字探勘方法於傳播議題上的能力 2. 經過相關議題背景介紹後，學生有查找相關文獻的能力 3. 學生有能夠獨立實作文字探勘技術的能力
課程要求	1. Essay 60% (Proposal 15%; Initiative: 10%; Final work: 35%) - Proposal (at least 6 pages with font size 12 and single space in google docs. Required to search additional reference for supporting your idea) - Initiative(Required to meet with instructor personally at least 2 times, with preparation of handout, slides, and ongoing methods, experiments, or programming) - Final (Presentation 10%, Essay 25% at least 10 pages in APA format with correct reference style, and figure/table captions) 2. LLR (Leading Literature Review) 20% - Required to summarize (with handout) and lead course discussion for specific topics (1~2 times, 5% for the handout, 10% for leading discussion). - Remaining 5% for weekly literature review (Reading before class, proposing one question, and one related research article) 3. Implementing specific text mining for specific topic 20%: - Required to implement skills or algorithm for specific topics such as political communication, health communication, or user modeling (Examples of techs collocation with PMI, topic modeling and visualization, crawling wiki DB for improving tokenization, using word embedding for doc classification).
預期每週課後學習時數
Office Hours	備註： Mon. 14:00~16:00
指定閱讀	Ch 1~5, Manning, C. D., Manning, C. D., & Schütze, H. (1999). Foundations of statistical natural language processing. MIT press. Ch 1~6, Weiss, S. M., Indurkhya, N., & Zhang, T. (2015). Fundamentals of predictive text mining. Springer.
參考書目	待補
評量方式 (僅供參考)

課程進度

週次	日期	單元主題
第1週	2/21	Course overview Preliminary proposal
第2週	2/28	228
第3週	3/07	Proposal
第4週	3/14	Chapter: Corpus and POS
第5週	3/21	Chapter: Collocation # AS1 (10%) Collocation algorithms for personal topics and network visualization
第6週	3/28	LLR: Fake news and detections
第7週	4/04	Spring break
第8週	4/11	Proposal: Final dataset, preprocessing, and descriptive reports
第9週	4/18	Word embeddings and text classification (demo) # AS2: (1) Building reserved vocabulary by wikipedia DB with crawler and HMM; (2) Verifying tokenization efficacy and accuracy; (3) Building personal word2vec CBOW and Skip-gram.
第10週	4/25	LLR: Clickbait and detection
第11週	5/02	LLR: Uses of topic modeling
第12週	5/09	Reporting implementation of text mining methods
第13週	5/16	LLR: Political discourse analysis with text mining
第14週	5/23	Proposal: Methods
第15週	5/30	LLR: Tweet analysis
第16週	6/06	LLR: Tweet analysis
第17週	6/13	Reporting project results
第18週	6/20	Final review and discussion